NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?

Zhang, Yusen; Zheng, Wenliang; Madasu, Aashrith; Shi, Peng; Kamoi, Ryo; Zhou, Hao; Zou, Zhuoyang; Zhao, Shu; Das, Sarkar_Snigdha_Sarathi; Gupta, Vipul; et al (October 2025, International Conference on Computer Vision)

Full Text Available
Navigating the Dual Facets: A Comprehensive Evaluation of Sequential Memory Editing in Large Language Models

https://doi.org/10.18653/v1/2024.acl-long.742

Lin, Zihao; Beigi, Mohammad; Li, Hongxuan; Zhou, Yufan; Zhang, Yuxiang; Wang, Qifan; Yin, Wenpeng; Huang, Lifu (January 2024, Association for Computational Linguistics)

Memory Editing (ME) has emerged as an efficient method to modify erroneous facts or inject new facts into Large Language Models (LLMs). Two mainstream ME methods exist: parameter-modifying ME and parameter-preserving ME (integrating extra modules while preserving original parameters). Regrettably, previous studies on ME evaluation have two critical limitations: (i) evaluating LLMs with single edit only, neglecting the need for continuous editing, and (ii) evaluations focusing solely on basic factual triples, overlooking broader LLM capabilities like logical reasoning and reading understanding. This study addresses these limitations with contributions threefold: (i) We explore how ME affects a wide range of fundamental capabilities of LLMs under sequential editing. Experimental results reveal an intriguing phenomenon: Most parameter-modifying ME consistently degrade performance across all tasks after a few sequential edits. In contrast, parameter-preserving ME effectively maintains LLMs’ fundamental capabilities but struggles to accurately recall edited knowledge presented in a different format. (ii) We extend our evaluation to different editing settings, such as layers to edit, model size, instruction tuning, etc. Experimental findings indicate several strategies that can potentially mitigate the adverse effects of ME. (iii) We further explain why parameter-modifying damages LLMs from three dimensions: parameter changes after editing, language modeling capability, and the in-context learning capability. Our in-depth study advocates more careful use of ME in real-world scenarios.
more » « less
Full Text Available
Automated assessment of creativity in multilingual narratives.

https://doi.org/10.1037/aca0000725

Luchini, Simone A; Moosa, Ibraheem Muhammad; Patterson, John D; Johnson, Dan; Baas, Matthijs; Barbot, Baptiste; Bashmakova, Iana; Benedek, Mathias; Chen, Qunlin; Corazza, Giovanni E; et al (March 2025, Psychology of Aesthetics, Creativity, and the Arts)

Full Text Available
Indirectly Supervised Natural Language Processing

https://doi.org/10.18653/v1/2023.acl-tutorials.5

Yin, Wenpeng; Chen, Muhao; Zhou, Ben; Ning, Qiang; Chang, Kai-Wei; Roth, Dan (January 2023, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 6: Tutorial Abstracts))

This tutorial targets researchers and practitioners who are interested in ML technologies for NLP from indirect supervision. In particular, we will present a diverse thread of indirect supervision studies that try to answer the following questions: (i) when and how can we provide supervision for a target task T, if all we have is data that corresponds to a “related” task T′? (ii) humans do not use exhaustive supervision; they rely on occasional feedback, and learn from incidental signals from various sources; how can we effectively incorporate such supervision in machine learning? (iii) how can we leverage multi-modal supervision to help NLP? To the end, we will discuss several lines of research that address those challenges, including (i) indirect supervision from T ′ that handles T with outputs spanning from a moderate size to an open space, (ii) the use of sparsely occurring and incidental signals, such as partial labels, noisy labels, knowledge-based constraints, and cross-domain or cross-task annotations—all having statistical associations with the task, (iii) principled ways to measure and understand why these incidental signals can contribute to our target tasks, and (iv) indirect supervision from vision-language signals. We will conclude the tutorial by outlining directions for further investigation.
more » « less
Full Text Available
Ultra-fine Entity Typing with Indirect Supervision from Natural Language Inference

https://doi.org/10.1162/tacl_a_00479

Li, Bangzheng; Yin, Wenpeng; Chen, Muhao (January 2022, Transactions of the Association for Computational Linguistics)

Abstract The task of ultra-fine entity typing (UFET) seeks to predict diverse and free-form words or phrases that describe the appropriate types of entities mentioned in sentences. A key challenge for this task lies in the large number of types and the scarcity of annotated data per type. Existing systems formulate the task as a multi-way classification problem and train directly or distantly supervised classifiers. This causes two issues: (i) the classifiers do not capture the type semantics because types are often converted into indices; (ii) systems developed in this way are limited to predicting within a pre-defined type set, and often fall short of generalizing to types that are rarely seen or unseen in training. This work presents LITE🍻, a new approach that formulates entity typing as a natural language inference (NLI) problem, making use of (i) the indirect supervision from NLI to infer type information meaningfully represented as textual hypotheses and alleviate the data scarcity issue, as well as (ii) a learning-to-rank objective to avoid the pre-defining of a type set. Experiments show that, with limited training data, LITE obtains state-of-the-art performance on the UFET task. In addition, LITE demonstrates its strong generalizability by not only yielding best results on other fine-grained entity typing benchmarks, more importantly, a pre-trained LITE system works well on new data containing unseen types.1
more » « less
Full Text Available

Search for: All records